Speaker Identification Using Gaussian Mixture Models
نویسندگان
چکیده
In this paper, the performance of Perceptual Linear Prediction (PLP) features has been compared with the performance of Linear Prediction Coefficient (LPC) features for speaker identification. Two classification techniques, Gaussian Mixture Models (GMM) and Vector Quantization (VQ) with Dynamic time wrapping (DTW) are used for classification of speakers based on their speech samples into respective classes. A database of fifty speakers, twenty one males and twenty nine females has been prepared in clean environment. The identification performance of PLP features is 3.6%better than LPC features with VQ and DTW classifier. PLP features have also shown 1.2% increment in identification performance over LPC features with GMM classifier. Keywords—Gaussian Mixture Models, Perceptual Linear Prediction, Linear Prediction Coefficient, Vector Quantization
منابع مشابه
Robust text-independent speaker identification using Gaussian mixture speaker models
This paper introduces and motivates the use of Gaussian mixture models (CMM) for robust text-independent speaker identification. The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are efTective for modeling speaker identity. The focus of this work is on applications which require high identification rates using short utterance ...
متن کاملText-independent speaker identification using Gaussian mixture bigram models
In this paper, a novel speaker modeling technique based on Gaussian mixture bigram model (GMBM) is introduced and evaluated for text-independent speaker identification (speaker-ID). GMBM is a stochastic framework that explores the context or time dependency of continuous observations from an information source. In view of the fact that speech features are correlated between successive frames, w...
متن کاملText-Independent Speaker Recognition Using Gaussian Mixture Models Final Term Paper Proposal
The proposed project is an implementation of speaker recognition systems, both identification and verification. The systems are built using Gaussian Mixture Models, as proposed in several papers from Douglas A. Reynolds. The use of Fractional Covariance Matrix is studied as an possible increase for the traditional recognition systems. keywords: speaker recognition; Gaussian Mixture Models; like...
متن کاملSpeaker Identification From Youtube Obtained Data
An efficient, and intuitive algorithm is presented for the identification of speakers from a long dataset (like YouTube long discussion, Cocktail party recorded audio or video).The goal of automatic speaker identification is to identify the number of different speakers and prepare a model for that speaker by extraction, characterization and speaker-specific information contained in the speech s...
متن کاملSpeaker Identification And Verification Of Noisy-Echoed Speech Using Gaussian Mixture Models
— The two major applications of speaker recognition applications are speaker verification and speaker identification. But in most of the cases the signal is corrupted with background interferences such as noise and echo. This paper proposes the method of speaker recognition and identification after the noise separation and echo cancellation. Support vector machine(svm) classification based sign...
متن کامل